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Abstract 

DNA tile self-assembly has emerged as a rich and promising primitive for nano-technology. This pa- 
per studies the problems of minimizing assembly time and error rate by changing the tile concentrations 
because changing the tile concentrations is easy to implement in actual lab experiments. We prove that 
setting the concentration of tile Tj proportional to the square root of iV, where iVj is the number of times 
Tj appears outside the seed structure in the final assembled shape minimizes the rate of growth errors 
for rectilinear tile systems. We also show that the same concentrations minimize the expected assembly 
time for a feasible class of tile systems. Moreover, for general tile systems, given tile concentrations, 
we can approximate the expected assembly time with high accuracy and probability by running only a 
polynomial number of simulations in the size of the target shape. 
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1 Introduction 



Considerable modern research in science and engineering has aimed to control smaller and smaller systems 
in many fields, including computer science and material science. As the size of a system approaches the 
molecular scale, precise direct external control becomes prohibitively costly, if not impossible. As a result, 
bottom-up self-assembly has emerged as a rich and promising primitive for nano-technology. In particular, 
DNA has received much attention as a substrate for molecular self-assembly because its combinatorial na- 
ture enables the programming of molecular behaviors by choosing appropriate DNA sequences to encode 
information. In addition, lab techniques for the manipulation of DNA are already well developed. For 
these considerations, DNA self-assembly has been proposed for a variety of applications, e.g., as a means 
to perform computation (4j|22||29j, construct molecular patterns (9]|T2}[T8][T9}|24|[34j, and build nano-scale 
machines |[5j[T0j[T3j|23j|25j[33j. 

DNA tiles which self-assemble according to simple rules have been developed in lab [3T| and mathe- 
matically analyzed based on the combinatorial tile assembly model (aTAM) proposed by Rothemund and 
Winfree p7| . Under this model, there is a set of square tiles with a glue on each of the four edges. Each 
glue has a certain affinity for itself called strength. The self-assembly process starts from a distinguished 
seed structure. Assembly proceeds as tiles attach to the partially assembled structure (initially, just the seed 
structure) one by one when the combined strength of matched glues between a tile and the partial structure 
is at least the temperature of the tile system. Many interesting tile systems have been designed under aTAM, 
including systems that build counters |l]|8j and squares pT][T5|[T7| , perform Turing-universal computa- 
tion [29], and produce arbitrary computable shapes |16[27|. Unfortunately, in laboratory settings, several 
events that aTAM does not model have been frequently observed. These events are referred to as errors 
in the tile self-assembly process. A more realistic stochastic model called the kinetic tile assembly model 
(kTAM) was proposed by Winfree [29| to describe the rates of these errors. The kTAM model calculates the 
rates for various types of attachments and detachments of tiles based on thermodynamics. 

In order to make DNA tile self-assembly practical, there are two important factors that need to be mini- 
mized, namely, the error rate and the time of the assembly process. One approach to reducing the error rate 
of a tile assembly system (6][7][l4][26j[30j is to convert an existing error-prone tile system to a more robust 
tile system that assembles into the same shape or pattern up to scaling. These error correcting techniques 
increase the number of tile types by a multiplicative factor and thus are hard to implement in practice. In 
contrast, it is easy to change the concentrations of tiles. Therefore, it is natural to consider reducing the er- 
ror rate by changing the concentrations of tiles. This approach has been studied using computer simulations 
and lab experiments. However, no closed-form formulas or efficient algorithms for finding the optimal tile 
concentrations have been previously found. It is also natural to consider changing the tile concentrations in 
order to minimize the assembly time. Adleman et al. [2] designed an algorithm to find tile concentrations 
that approximate the minimum expected assembly time within an 0(log n) factor. Goel et al. pi showed that 
if all tiles have equal concentrations, then the expected assembly time is proportional to the longest length 
of a path in the assembly order of the target shape. Also, some studies employed computer simulations to 
characterize the trade-offs between the time and the error rate of an assembly process |6l[30[. 



Our Results On the problem of minimizing the error rate, we formulate the rate of growth errors in terms 
of tile concentrations based on the kinetic tile assembly model. Using our formulation, we show that setting 
the concentration of each tile Tj proportional to the square root of the number of times T{ appears outside 
the seed structure in the target shape minimizes the rate of growth errors. This result holds for all rectilinear 
tile systems (i.e., tile systems that have the same growth directions for all tiles fixed throughout the assembly 



1 



process) as well as many other systems that have been implemented in lab (3j|2TJ . We also have simulation 
results showing that facet errors can significantly affect the accuracy of the optimal tile concentrations 
predicted by our mathematical analysis. On the problem of minimizing the assembly time, we prove that the 
above concentrations for minimizing the rate of growth errors also minimizes the expected assembly time 
for a feasible class of tile systems. Moreover, for general tile systems, given tile concentrations, we show 
that the average assembly time over a polynomial number of simulations in the size of the target shape can 
approximate the expected assembly time with high accuracy and probability. 

The remainder of this paper is organized as follows. Section [2] describes the two tile assembly models 
that we use. Section [3] contains the theoretical results on minimizing the rate of growth errors. Section [4] 
contains the simulation results on growth errors and some discussion on facet errors. Section [5] contains the 
theoretical results on estimating and minimizing the expected assembly time. Section|6]concludes the paper 
with some open problems. 

2 Two Tile Assembly Models 

The Combinatorial Tile Assembly Model The combinatorial tile assembly model was proposed by 
Rothemund and Winfree p7| . It extends the theoretical model of tiling by Wang (28J to include a mecha- 
nism for growth based on the physics of molecular self-assembly. Informally, a tile self-assembly system 
has a set of tiles, each of which is a square with glues of various types on each of the four edges. Two tiles 
will stick to each other if they have compatible glues. Below we present a succinct definition of this model 
with minor modifications for ease of explanation. 

A tile is an oriented unit square with the north, east, south and west edges labeled from some alphabet 
£ of glues. For each tile t, the glues of its four edges are denoted as o"jv(*)> <7e(*)> os(t), and a\y(t)- We 
describe a tile t as the quadruple (<tjv(£), o~E{t), o~s{t),aw(t)). Consider the triple <T, g, r> where T is a 
finite set of tiles, r G Z>o is the temperature, and g is the glue strength function from £ x £ to Z>o. It is 
assumed that for all x, y G £, the inequality x ^ y implies g(x,y) = and there is a glue null E £, such 
that g(x, null) = g(null, x) = for all x G £. A configuration is a map from Z 2 to T\J empty, where 
empty is a special symbol indicating the absence of any tile. 

A tile system is a quadruple T = <T, s, g, r>, where T, g, r are as above and s is a special configuration 
called the seed structure. Let C and D be two configurations. Suppose that there exist some t G T and some 
(x,y) G Z 2 such that D = C except that at (x,y), C(x,y) = null and D(x,y) = t. Let fN,c,t(x,y) = 
g(aN{t),crs(C(x,y + 1)). Informally, fN,c,t(x,y) is the strength of the bond on the north edge of t in 
configuration C. We define fs,c,t(x,y),fE,c,t(x,y) and fw,c,t(x,y) similarly. Then tile t is attachable 
to C at position (x,y) iff fo,t(x,y) = fN,C,t(x,y) + fs,C,t{x,y) + fE,c,t( x 'V) + fw,C,t(x,y) > t. We 
write C — >i D to denote the transition from C to D by attaching a tile to C at position (x, y). Informally, 
C — >t D iff D can be obtained from C by adding a tile t such that the total strength of interaction between 
t and C is at least r. A terminal assembly is a configuration A such that there is no configuration B for 
which A — »t B. 

When a tile t attaches to configuration C at position (x, y), the edges U of t with fu,c,t(x, y) > are 
called the input edges; all other edges are called the output edges. A tile system is rectilinear if there is a 
unique terminal assembly that can be reached starting from the seed structure, each tile t has the same input 
and output edges every time it attaches, and all tiles have the same input and output edges. 

The Kinetic Tile Assembly Model According to the combinatorial tile assembly model, a tile t attaches 
at a position (x, y) in a configuration C iff the total strength fc,t(x, y) of matched glues is at least r, and 
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any tiles that attached never fall off. In practice, tiles may attach with a weaker binding strength, and tiles 
that already attached may fall off. These events can cause the tile system to behave differently from the 
combinatorial tile assembly model. We treat these deviations as errors and try to minimize the probability 
of having these events. In this paper, we use the kinetic tile assembly model proposed by Winfree (29) 
to model the forward and reverse rates, which are the rates at which a tile attaches to and falls off from a 
specific position, respectively. This model computes these rates as functions of thermodynamic parameters 
as follows: 

1. The concentrations of the tiles are held constant throughout the self-assembly process. 

2. The only two reactions allowed are single tiles attaching to and dissociating from a configuration. 

3. The forward rate for tile Tj is kfCi, where kf is a constant and a is the concentration of tile Tj. This 
notation is used throughout this paper. 

4. The reverse rate for a tile t attached to configuration C at position (x, y) to fall off is k r e~ bGse , where 
k r and G se are constants and b = fc,t{ x i v) * s tne tota l strength of matched glues between t and C. 

Here, the parameters kf and k r give the time scale of the self-assembly. The value of G se is determined 
by the binding strength of the sticky ends of DNA tiles. We use c max and c m m to denote the maximum and 
minimum concentrations allowed in the tile system. If one wants the tile system to assemble according to 
the combinatorial tile assembly model most of the time, then the following two conditions need to hold. 
First, if the total binding strength between a tile and the original configuration it just attached to is less than 
r, then the tile must fall off quickly, i.e., 

tv r c "'/'-'max- 

Second, if a tile t is attachable to a position in C, then the forward rate at which it attaches should be greater 
than the reverse rate at which it falls off, i.e., 

kfCxain > k r d 

In practice, since each tile may have a slightly different value for the parameter kf and the strength of each 
glue may vary, one often needs to set kf and k r (by changing an experiment's temperature) such that 

fc/Cmin ^> k r 6 Be . 

For the remainder of the paper, we assume 

k r e ^ ^ 3e 3> kfC max > kfCrxan k r e se . 

We also assume that our seed structure is made by some other processes (e.g., DNA origami |18j) and its 
tiles never fall off. 



3 Minimizing the Error Rate 

In this section, we consider the problem of changing the concentrations of tiles to minimize the failure 
probability (i.e., error rate) for a rectilinear tile system. There are three types of errors in tile self-assembly. 



A growth error refers to an incorrect tile attaching at a position instead of the correct tile [30]. A facet error 
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refers to an incorrect tile attaching at a position where no tile is supposed to attach (6J. A nucleation error 



refers to single tiles attaching to each other to form a lattice without the seed structure |20|. In this section, 
we only consider minimizing growth errors. It has been shown [30} that if one slows down the growth of 
the boundary tiles of the target shape, the growth front will be roughly triangular. In this situation, growth 
errors are the dominant type of errors among the above three types. 

We want to compute the probability that a tile Tj causes a growth error by attaching at a position (x, y) 
where only T can attach with total binding strength at least r. First, Tj can attach at that position at rate 
kjCi. Once T is attached, the probably of it falling off is negligible since /c/c mm >> k r e~ rGse . Second, 
Tj can attach at that position at rate kfCj. When Tj attaches, it can fall off at rate k r e~^ T ~ m ' Gae , where m 
is the total strength of the mismatched glues between T and Tj on their input edges. Tj can also get locked 
in place and cause an error due to the attachment of one or more adjacent tiles. The rate r at which Tj gets 
locked in place may vary with the features in the partially assembled shape near position (x, y) such as long 
facets. In this paper, we assume that r is the same for all positions (x, y) and tiles t. The allowable reactions 
related to T and Tj are summarized in Figure[TJ From the above description of reaction rates, we know that 
at a given position (x, y) where tile T is supposed to attach, the probability of having a growth error caused 
by Tj is ZLeij. Here, = r+fcr£ _ ( r T _ m)Gse , which is roughly at the order of e ~ mGae since r < 2k f c mSLX . 
Therefore, at position (x,y), the total probability of a growth error is 
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Figure 1: A Markov chain describing attachments of T,; and Tj, where C indicates a correct tile. 

For a self-assembly process, the error rates at different positions depend on each other. However, if one 
wants to have a high probability of success, one almost always needs to set the experimental condition such 
that the error rate at each position is much smaller than l/n. In this case, minimizing the sum of error rates 
over all positions is a good approximation of minimizing the actual overall error rate of the tile assembly 
system. Thus, in the remainder of this section, we will minimize 

£^(^% 0) 

i 

where iVj is the number of positions outside the seed structure to which Tj is supposed to attach. 
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Theorem 1 For a rectilinear tile system with a unique terminal assembly, the error rate (i.e., probability of 
failure) is minimized when the concentration of each tile T{ is proportional to \/Ni, where Ni is the number 
of times tile T, appears outside the seed structure in the correct terminal assembly of the tile system. 

Proof: From Equation [TJ we can scale the tile concentrations Cj without loss of generality such that 
Y^i c-i = 1, and we need to solve the following minimization problem: 

Minimize £ Nl {^^), 

t 

subject to = 1. 

i 

The Lagrange multiplier for this minimization problem is 

a = ^(%^) + a£,-i). 



We need to solve 



and 

Simplifying Equation |2| we obtain 



P = - Ni ip*gM ) + + A = for all- 

oci cf rr? Cj 



(2) 



E c * = L 



^(-Ni^g- +Nj-^) + A = far all i, 



and consequently the error rate is minimized when 



IK 



□ 

Two points about the proof of Theorem [T] are worth noticing. Firs, the error rate only depends on the 
ratio between the tile concentrations. Therefore, the error rate is minimized when the concentration of T« is 
proportional to \fNl even when we vary each Cj between c max and c m \ n . Second, the same proof can apply 
to all tile systems that satisfy = eji for all i, j. Hence Theorem[T]is valid for many other systems already 
implemented in lab such as zig-zag ribbons pT| and counters seeded by origami [3]. Moreover, Theorem [T] 
still applies if we add either a uniform proofreading scheme | [30) or a snaked proofreading scheme [ 6 ] to the 
tile system. However, for a proofreading system to work, we need kf-oi to be sufficiently close to k r e~ rGae 
for all tiles Tj, which is false in the parameter range that we use in this paper. 
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4 Simulation Results for Theorem Q] 



We used a software called xgrow [32] to simulate four tile systems to determine their error rates under 
different tile concentrations. To obtain a good estimate of the error rate of a tile system, we would choose 
our parameters such that errors can be frequently observed. However, in most tile systems, if we use such 
parameters, we will reach some configuration very different from the terminal assembly predicted by the 
combinatorial tile assembly model. Since our prediction of the optimal tile concentrations depends on the 
terminal assembly, we made a design decision to perform simulations on tile systems for which each error 
only affects one position of the terminal assembly. 

We simulated four tile systems named as A\, A%, B\, and B2. Each of the four systems operates at 
r = 2 and only has two tiles X and Y shown in Figure pt a) beside the seed structure. The only difference 
between the four systems is their seed structures. The seed structures of tile systems A\ and A2 are shown 
in Figure |2jb). The lengths of their seed structures are adjusted such that Nx : Ny = 25 : 1 and 64 : 1 for 
A\ and A%, respectively, where Nx and Ny are the numbers of positions X and Y appear in the terminal 
assembly. The seed structures of tile systems B\ and B2 are shown in Figure|2jc). The lengths of their seed 
structures are also adjusted such that Nx '■ Ny = 25 : 1 and 64 : 1 for B\ and B2, respectively. These 
systems are rectilinear with all tiles having their input edges on the south and east edges. Since tiles X and Y 
have the same output edges, when an error happens, the error only affects the position where the erroneous 
tile is attached. The unique terminal assemblies and example configurations generated by simulations of tile 
systems A\ and B\ are shown in Figures [3] and [4j respectively. Theorem [T] predicts that the rate of growth 



errors is minimized when cx 
A2 and f?2- 



cy = 5 : 1 for systems A\ and B\, and when cx '■ cy = 8 : 1 for systems 
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Figure 2: (a) Tiles X and Y, where all glues have strength 1. (b) An L-shaped seed sttucture for systems A\ and A%. 
(c) A seed structure with two vertical facets and one horizontal facet for systems Bi and B 2 - 



The simulation results are shown in Figure [5] In systems A\ and A2, the optimal tile concentration 
ratios are 2.5 : 1 and 3:1, respectively. In systems B\ and B2, the optimal tile concentration ratios are 
roughly 7.5 : 1 and 15 : 1, respectively. The major reason causing these deviations from the predictions 
made by Theorem [T] is the facet errors. Since tiles X and Y both have glue on the south edge, having 
a long horizontal facet may introduce a large number of facet errors. For systems A\ and A2, notice that 
long horizontal facets are generated because tile Y (colored yellow) has lower concentrations and grows 
slower than X. An example configuration for system A\ that demonstrates these horizontal facets is shown 
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Figure 3: The left figure is the unique terminal assembly for system A\. The right figure is an example configuration 
for system A\ generated by an xgrow simulation. 




Error Rates vs. Tile Concentrations for Systems A1 and A2 




ratio between Cx and Cy 



Error Rates vs. Tile Concentrations for Systems B1 and B2 




ratio between Cx and Cy 



Figure 5: Plots of error rates vs. the ratios between tile concentrations. Each data point represents m = 
simulations. The simulations use G se — 9 for systems A\ and A2, G se = 11 for systems B\ and B2, cx+cy 
and kf = k r . Error bars show two standard deviations of the errors, computed using a = cumulation /v^- 
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in Figure [3] Such undesirable facets become longer and more when we increase the ratio between cx and 
cy. Therefore, the actual optimal tile concentration is biased towards having more Y than predicted by 
Theorem [T] For systems B\ and B2, each terminal assembly is separated into a left portion and a right 
portion by the seed structure, as shown in Figure [4] Horizontal facets can only be generated in the left 
portion, where all tiles should be X. Hence, we can reduce facet errors by decreasing the concentration 
of tile Y and thus the actual optimal tile concentration is biased towards having fewer Y than predicted by 
Theorem [TJ 



5 Minimizing the Expected Assembly Time 

This section assumes that only the correct tiles can attach and any tile that already attached never falls off. 
We minimize the expected assembly time by varying the tile concentrations. 

Theorem 2 Consider any tile system with the four properties that 

1. tiles can only attach one by one, 

2. only the correct tiles can attach, 

3. any tile that already attached never falls off, and 

4. there is a unique terminal assembly. 

Assume that the total tile concentration is ^ ■ Cj = 1. Setting Ci = v » x/ ^% r minimizes the expected assembly 
time of the tile system. 



Proof: From the kinetic tile assembly model |29|, the expected time for a tile Tj to attach to a location 
(x, y) is j-^—. By the four properties of the given tile system in the theorem, the expected assembly time is 

itf 2^2i ^t~> where Ni is the number of times Tj appears outside the seed structure in the terminal assembly. 

We want to minimize the sum J2 i ^ subject to 2~2i (% = 1. The Lagrange multiplier for this minimization 
problem is 



Solving the equations 



and 



= -^ + A = foralU 

OCi cf 



% 



we obtain 



□ 

In settings that are more general than Theorem [2] assumes, the optimal tile concentrations to minimize 
the expected assembly time may significantly deviate from the Cj's determined in Theorem [T] Lemma [3] 
describes an example of such deviation. 
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Lemma 3 Consider a tile system for which properties [2] through [4] in Theorem [2] hold but property [7] is 
replaced by allowing tiles to attach in parallel whenever they can. Also, as in Theorem [2] assume that the 
total tile concentration is c, = 1. Further assume that in the final assembly, there are N tiles not in the 
seed structure, and all N tiles can attach to the seed structure directly and in parallel. Then the optimal tile 
concentrations that minimize the expected assembly time must satisfy C{ > k ^ N , where k is the number of 
tile types. 

Proof: A key point of this proof is that the attachment of tiles to all N positions outside the seed structure in 
the terminal assembly can happen in parallel. If all tiles have the same concentration 1/k, then the expected 
assembly time is k In N. If any tile has concentration lower than k ^ N , then the attachment of that single tile 
non-optimally takes expected time more than k In N. Thus, when the expected assembly time is minimized, 
all tile concentrations must be at least k ^ N - □ 
For general tile assembly systems, we do not know how to analytically find the optimal tile concentra- 
tions to minimize the expected assembly time. However, we show in Theorem[5]below that given a set of tile 
concentrations, only a polynomial number of simulations is required in order to approximate the expected 
assembly time with high accuracy and probability. 

Lemma 4 Consider any tile system for which properties [2] through |4] in theorem [2] hold but there is no 
assumption on whether tiles can attach only one by one or in parallel. If the assembly process of the tile 
system takes expected time S, then for any e > 0, the average of the assembly times over 485 2 ^ simulations 
of the assembly process will be between S — e and 5 + e with probability at least 3/4. 

Proof: We only need to bound the variance of the assembly time of a simulation to show that the average of 
the assembly times of simulations converges fast. Suppose the expected time for the final shape to assemble 
is 5. From Markov's inequality, the probability of having the time of a simulation greater than 25 is at most 
0.5. Also, the configuration at time 25 contains the seed structure because tiles in the seed structure do not 
fall off. Therefore if the assembly process is not finished at time 25, then starting from time 25, the expected 
time for the process to finish is less than 5. Thus, the probability that a simulation takes more than 2kS 
time is at most 2~ k , and the variance of the assembly time of a simulation is at most ^ ^ (2iS) 2 = 245 2 . 
Thus, for any given e, if we run 485 2 \ simulations, then the probability that the average of the assembly 
times of the simulations lands between 5 — e and 5 + e is at least 3/4. □ 

Theorem 5 Consider any tile system for which properties [2] through |?] in Theorem [2] hold but there is no 
assumption on whether tiles can attach only one by one or in parallel. Let N be the number of positions 
outside the seed structure in the terminal assembly. If the assembly process of the tile system takes expected 
time S, then for any e > 0, the average of the assembly times over 0(n 4 — \— ) simulations of the assembly 

process will be between S — e and 5 + e with probability at least 3/4. 

Proof: Goel et al. [8] showed that if all tiles have concentrations c, then the expected assembly time 
is Q(d/c), where d is the length of the longest path in the assembly order of the terminal assembly. 
Since reducing tile concentrations only slows down the assembly process, the expected assembly time 5 
is 0(d/c m i n ) = Q(n/c m - m ). The theorem follows from this upper bound on 5 and Lemma |4j □ 

6 Further Research 

In Section [3] we gave closed-form formulas to minimize the growth errors by varying the concentration of 
each tile. In Section [4| we found in simulations that facet errors are also an important factor that needs to 
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be considered in lab implementations. At the theoretical level, it is open to find closed-form formulas or 
efficient algorithms to minimize the facet errors by varying the tile concentrations. 

In Section [5] we gave closed-form formulas to minimize the expected assembly time for a feasible 
class of tile systems by varying the tile concentrations. For general tile systems, the best known algorithm 
can compute an O (log n) -approximation of the minimum expected assembly time [2]. It is of interest 
to determine whether one can compute the precise minimum expected assembly time or an estimate with 
a better approximation factor than O(logra). Given tile concentrations, we showed that simulations can 
accurately predict the expected assembly time with high probability. The time it takes to run the required 
simulations is polynomial in the size of the terminal assembly, not the tile system itself. It would be useful 
if one can approximate the expected assembly time just by analyzing the tile system and some succinct 
features of the terminal assembly (e.g., the number of times each tile appears outside the seed structure in 
the terminal assembly) in time polynomial in the size of the tile system. 

References 

[1] L. Adleman, Q. Cheng, A. Goel, and M.-D. Huang. Running time and program size for self-assembled 
squares. In Proceedings of the Thirty-third Annual ACM Symposium on Theory of Computing, pages 
740-748, 2001. 

[2] L. Adleman, Q. Cheng, A. Goel, M.-D. Huang, D. Kempe, P. Moisset de Espans, and P. Rothemund. 
Combinatorial optimization problems in self-assembly. In Proceedings of the Thiry-Fourth Annual 
ACM Symposium on Theory of Computing, pages 23-32, 2002. 

[3] R. Barish, R. Schulman, P. W. K. Rothemund, and E. Winfree. An information-bearing seed for 
nucleating algorithmic self-assembly. In Proceedings of the National Academy of Sciences, volume 
106, pages 6054-6059, 2009. 

[4] R. D. Barish, P. W. K. Rothemund, and E. Winfree. Two computational primitives for algorithmic 
self-assembly: Copying and counting. Nano Letters, 5(12):2586-2592, 2005. 

[5] J. Bishop and E. Klavins. An improved autonomous DNA nanomotor. Nano Letters, 7(9):2574-2577, 
2007. 

[6] H.-L. Chen and A. Goel. Error free self-assembly using error prone tiles. In Proceedings of the Tenth 
International Meeting on DNA Based Computers, pages 62-75, June 2004. 

[7] H.-L. Chen, C. Luhrs, and A. Goel. Dimension augmentation and combinatorial criteria for efficient 
error-resistant DNA self-assembly. In Proceedings of the Nineteenth Annual ACM-SIAM Symposium 
on Discrete Algorithms, pages 409-418, 2008. 

[8] Q. Cheng, A. Goel, and P. Moisset. Optimal self-assembly of counters at temperature two. In Pro- 
ceedings of the First Conference on Foundations of Nanoscience: Self-Assembled Architectures and 
Devices, pages 62-75, 2004. 

[9] H. Dietz, S. Douglas, and W. Shih. Folding DNA into twisted and curved nanoscale shapes. Science, 
325:725-730, 2009. 

[10] B. Ding and N. Seeman. Operation of a DNA robot arm inserted into a 2D DNA crystalline substrate. 
Science, 384:1583-1585, December 2006. 



11 



[11] D. Doty. Randomized self-assembly for exact shapes. In Proceedings of the 50th Annual IEEE Sym- 
posium on Foundations of Computer Science, pages 85-94, 2009. 

[12] S. Douglas, H. Dietz, T. Liedl, B. Hogberg, F. Graf, and W. Shih. Self-assembly of DNA into nanoscale 
three-dimensional shapes. Nature, (459):414-418, 2009. 

[13] S. Green, J. Bath, and A. Turberfield. Coordinated chemomechanical cycles: a mechanism for au- 
tonomous molecular motion. Physical Review Letters, (101):238101, 2008. 

[14] S. Sahu J. Reif and P. Yin. Compact error-resilient computational DNA tiling assemblies. In Proceed- 
ings of the Tenth International Meeting on DNA Based Computers, pages 293-307, 2004. 

[15] M.-Y. Kao and R. Schweller. Reducing tile complexity for self-assembly through temperature pro- 
gramming. In Proceedings of the Seventeenth Annual ACM-SIAM Symposium on Discrete Algorithms, 
pages 571-580, 2006. 

[16] M. Lagoudakis and T. LaBean. 2D DNA self-assembly for satisfiability. In Proceedings of the 5th 
DIMACS Workshop on DNA Based Computers in DIMACS Series in Discrete Mathematics and Theo- 
retical Computer Science, volume 54, pages 141-154, 1999. 

[17] P. Rothemund and E. Winfree. The program-size complexity of self-assembled squares (extended 
abstract). In Proceedings of the Thirty-Second Annual ACM Symposium on Theory of Computing, 
pages 459-468, 2000. 

[18] P. W. K. Rothemund. Folding DNA to create nanoscale shapes and patterns. Nature, (440):297-302, 
March 2006. 

[19] P. W. K. Rothemund, N. Papadakis, and E. Winfree. Algorithmic self-assembly of DNA Sierpinski 
triangles. PLOS Biology, 2:424-436, 2004. 

[20] R. Schulman and E. Winfree. Programmable control of nucleation for algorithmic self-assembly. In 
Proceedings of the Tenth International Meeting on DNA Based Computers, pages 319-328, 2004. 

[21] R. Schulman and E. Winfree. Self -replication and evolution of DNA crystals. In Proceedings of the 
Fifth European Conference on Artificial Life, pages 734-743, 2005. 

[22] G. Seelig, D. Soloveichik, D. Zhang, and E. Winfree. Enzyme-free nucleic acid logic circuits. Science, 
314:1585-1588, 2006. 

[23] W.B. Sherman and N.C. Seeman. A precisely controlled DNA bipedal walking device. Nano Letters, 
4:1203-1207,2004. 

[24] W. M. Shih, J. D. Quispe, and G. F. A. Joyce. A 1.7-kilobase single-stranded DNA that folds into a 
nanoscale octahedron. Nature, (427) :6 18-621, 2004. 

[25] J.-S. Shin and N.A. Pierce. A synthetic DNA walker for molecular transport. Journal of American 
Chemistry Society, 126:10834-10835, 2004. 

[26] D. Soloveichik, M. Cook, and E. Winfree. Combining self-healing and proofreading in self-assembly. 
Natural Computing, (7):203-218, 2008. 



12 



[27] D. Soloveichik and E. Winfree. Complexity of self-assembled shapes. SIAM Journal on Computation, 
36:1544-1569,2007. 

[28] H. Wang. Proving theorems by pattern recognition ii. Bell Systems Technical Journal, 1961. 40:1-42. 

[29] E. Winfree. Algorithmic Self-Assembly of DNA. PhD thesis, California Institute of Technology, 
Pasadena, 1998. 

[30] E. Winfree and R. Bekbolatov. Proofreading tile sets: Error correction for algorithmic self-assembly. 
In Proceedings of the Ninth International Meeting on DNA Based Computers. Madison, Wisconsin, 
June 2003. 

[31] E. Winfree, F. Liu, L. Wenzler, and N. Seeman. Design and self-assembly of two-dimensional DNA 
crystals, 6 pages. Nature, (394):539-544, Aug 1998. 

[32] E. Winfree et al. The xgrow simulator. http://www.dna.caltech.edu/Xgrow/xgrow-Www.html. 

[33] B. Yurke, A. Turberfield, A. Mills Jr, F. Simmel, and J. Neumann. A DNA-fuelled molecular machine 
made of DNA. Nature, (406): 605-608, Aug 2000. 

[34] Y. Zhang and N. Seeman. Construction of a DNA-truncated octahedron. Journal of American Chemical 
Society, 11 6(5): 1661, 1994. 



13 



